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We derive the equations of motion describing the feedback control of quantum systems 
in the regime of "good control" , in which the control is sufficient to keep the system 
close to the desired state. One can view this regime as the quantum equivalent of 
the "linearized" regime for feedback control of classical nonlinear systems. Strikingly, 
while the dynamics of a single qubit in this regime is indeed linear, that of all larger 
systems remains nonlinear, in contrast to the classical case. As a first application of 
these equations, we determine the steady-state performance of feedback protocols for a 
single qubit that use unbiased measurements. 
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1 Introduction 

The process of real-time feedback [l][2l[3] from a continuous measurement [4l[5] is a potentially 
important tool for obtaining precise control of noisy quantum systems [6l [7l l8l l9l [TOl ITTl [T2[ [T3[ 
[ll[l3[ll[17l[ia[Tl[20l[2ll[2l[2l[24]. In general the dynamics of a continuously observed 
quantum system is nonlinear [4]. Because of this, it is difficult, if not impossible, to obtain 
analytically, or even numerically, fully optimal feedback protocols for most quantum systems. 
One therefore wishes to find ways to simplify the problem, yet still obtain useful results. 

In considering feedback control of nonlinear classical systems, if one assumes that the 
deviations from the "target" state about which one is stabilizing the system are small, then the 
dynamics about this state are approximately linear. Thus the assumption of small deviations 
about the target state (that is, that the controller is able to effect good control) effectively 
"linearizes" the dynamics, allowing the application of the optimal control results for linear 
systems [25] . 

We now ask the question, if we consider an equivalent regime for quantum systems, that 
of "good control", does this simplify the dynamics of the feedback control process, and might 
it allow us to obtain optimal feedback protocols? Two previous works have examined this 
regime, but only in combination with "strong feedback" ([20l [26], see also [3]). This is 
the additional assumption that the Hamiltonian at the disposal of the controller can induce 
dynamics sufficiently fast compared to that of the noise and the measurement, that it can be 
assumed infinite. An optimal control protocol was obtained for a qutrit in this regime in [26] . 
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Since the regime of good control is one in which the majority of control systems will wish 
to operate, one would ideally like to drop the restriction of strong feedback, and obtain results 
that are applicable to all systems in the regime of good control. We achieve this by deriving 
the equations of motion for both the eigenvectors and eigenvalues of the density matrix under 
a continuous measurement of an arbitrary observable. The regime of good control can then 
be specified in terms of these eigenvalues and eigenvectors. 

As a first application of these equations we consider the simplest case, that of a single qubit, 
and derive an expression for the steady-state performance of feedback when the measurement 
is continually adapted so that the measured observable remains unbiased with respect to the 
density matrix. This has been shown to be the optimal measurement strategy in the limit 
of strong feedback [571 UHl HSl US]- In the steady-state, the performance of the feedback 
algorithm is quantified by the steady-state probability, Pgs, that the system will be found in 
the desired state (the target state). We derive an explicit expression for Pgs for the optimal 
linear feedback, for spontaneous decay and dephasing of the qubit. 

In the following section we derive the equations of motion for the eigenvalues and eigen- 
vectors of the density matrix under a continuous measurement of an observable, and use these 
to determine the equations of motion for feedback control in the regime of good control. In 
Section III we apply these to a specific problem for a single qubit. Section IV concludes with 
a discussion of the implications of these results. 

2 The Equations of Feedback Control 
2.1 Eigenvectors and Eigenvalues 

We first derive the equations of motion for the eigenvalues and eigenvectors of the density 
matrix under a continuous measurement of an arbitrary observable X. The evolution of 
the density matrix under such a measurement is given by the stochastic master equation 
(SME) 011] 

dp ^ -k[X,[X,p]]dt + V2k{Xp + pX-2{X)p)dW, (1) 

where p is the system density matrix, and fc, often referred to as the measurement strength, 
characterizes the rate at which the measurement extracts information. 

To obtain the equations of motion for the eigenvalues and eigenvectors, we need to know 
how they change to first order in dt. It turns out that time-independent perturbation theory 
is the perfect tool for this task. The density matrix at time t + dt is 

p{t + dt) = p{t) + dp. (2) 

Recall that time-independent perturbation theory is a method for determining the eigenvalues 
and eigenvectors of a Hamiltonian H , when 

H^Ho + XV, (3) 

the eigenvalues and eigenvectors of Hq are known, and A is a sufficiently small parameter |30) . 
The solution is given as a power series in A. 

Identifying p(t) with Hq and dp with XV , time-independent perturbation theory gives us 
the eigenvalues and eigenvectors of p{t + dt) in terms of p{t), which is what we need. In 
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calculating these eigenvectors and eigenvalues, it is important to note that since dp includes 
the stochastic Ito increment, dW , and since dW^ = dt, we must use the perturbation theory 
to second order in A. Note that while second-order perturbation theory would usually give an 
approximation to the true dynamics, in this case it generates the exact equations of motion 
for the eigenvectors and eigenvalues of p. 

In the following we will denote the eigenvalues of p{t) as A„(i). We will also write the 
eigenvectors of p at time t + dt in terms of the eigenvectors of p at time t. Denoting the former 
by \n{t + dt)), and the latter as \n{t)), we thus write 



\n{t + dt)) 



N-l 
J=0 



dCjn)\jit)). 



(4) 



Finally, we will denote the matrix elements of X in the eigenbasis of p{t) as Xjk; that is, 
Xjk = {j{t)\X\k{t)) . With this notation the equations of motion are 



dXr, 



8kdt 



E 



A; 



An — A; 



\Xnl 



Xn + VSkdW (^Xnn - E ^'^"^ ^" 



(5) 



dcjn — Akdt 



-kdt 



Xjn 



E 



1 A„(A„ + Xj) 
4" (A„-A,)2 



Xn 



A?, — A^ 



(A„ - Xj ) 



' Aj- + A„-2A; _ 2(Aj- + A;)(A; + A„) 
, (A„ — Aj) (A„ — A;)(A„ — Aj) 

An + Xj 



2kdW ( 1ZL__Z I Xjn, n ^ j 



XjlXin 



dc„ 



-kdt 



An — Xj 



{Xn+Xl)' 



7-; rT^\Xnl 



(An - Xlf 



(6) 
(7) 



This set of equations makes explicit the complexity of the dynamics of continuous measure- 
ment. 



2.2 Feedback Control 

We now turn to feedback control, in which the controller continually modifies the Hamiltonian 
of the system, based upon the measurement results, in order to bring the state of the system 
close to a "target" state The target state could change with time, but we will restrict 
ourselves here to a fixed target state. This loses little, as it is straightforward to modify the 
following equations to take into account an evolving target state. We will denote the total 
Hamiltonian of the system by H . Usually H breaks down naturally into the sum Hq + H{h{t), 
where H{h{t) is the part that the controller can modify. At time t, the Hamiltonian Hfh{t), 
and thus H{t), is some function of the measurement record up until that time. It can be 
shown that optimal control can always be realized by choosing H{t) to be a function of p{t) 
(which is itself obtained from the measurement record) |31| . 

While it is the target state that remains fixed, and the system density matrix that evolves, 
one can always view the dynamics as happening the other way around, since the first case is 
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related to the second by a unitary transformation. With this picture in mind, we wiU write 
the target state, in terms of the eigenvectors of p{t). Thus 



N-l 



1^) = znit)\n{t)). 



(8) 



k=0 



The equations of motion above for the Cj„ aUow us to determine the equations of motion of 
the Zn under the continuous measurement. To do so we note that since the transformation 
from {nit)) to \n(t + dt)) is unitary (to first order in dt), the inverse transformation is given by 
the Hermitian conjugate: \n{t)) = J2j=o^i^nj + dc^j)\j (t + dt)) . Substituting this into Eq.® 
gives the equations of motion for the z„, which are 



(9) 



To describe feedback control we must include, in addition to the dynamics induced by 
the measurement, the dynamics induced by the noise from the environment, as this is the 
reason that one needs to use feedback control in the first place, and the dynamics due to the 
Hamiltonian, H{t). The noise effects both the eigenvalues of the density matrix as well as the 
Zn, but the Hamiltonian, and thus the feedback, only influences the Zn- 

Environmental noise is usually described well by the Lindblad master equation |32| 



(10) 



where L is an arbitrary operator. We can have as many terms of the above form as we wish, 
so that the effect of the environment is more generally given by 

Pcnv = - XI + pL('")^L(") - 2L("Vi^"'^) ■ (11) 

m 

Using time-independent perturbation theory as above, the dynamics induced by Eq. (|10[) is 



At, 



-27 



^nDnn — Afc|L,; 



n 



^kL*i.Lnk 
An — Xi 



(12) 

(13) 
(14) 



where we have defined Djn — X^A; ^kj^kn- Finally, the dynamics induced by the Hamiltonian 



^j7i — ■ 



(15) 



The quantity that we want feedback control to maximize is the probability, P, that the 
system will be found in the target state. This is 



P = {i^Wm =Y,W?^n. 



(16) 



J. Li and K. Jacobs 5 



The performance of feedback control is thus completely described by the evolution of the A„ 
and the Zn- However, this evolution may also depend upon the dynamics of the elements 
of the eigenvalues not captured by the z„. This is bccaiisc the equations of motion for the 
\n and Zn depend on the noise operator L, and all our operators are written in the basis of 
the density matrix. As this basis evolves, the matrix elements of L also undergo an effective 
evolution. (The matrix elements of H and X will also evolve in the same way, if they are not 
being continually specified by the controller.) The equation of motion for the elements of an 
arbitrary operator A in the density matrix eigenbasis are 

dLmn = ^ ] {LjndCmj + L^jdc^j) + ^ ^ LjkdCmjdc^j.. (17) 

So, in addition to the motion of A„ and z„, we must include the evolution of the elements 
of the noise operator(s) in our feedback control problem. In general this also means that 
the A„ are coupled to the z„. In the most general case, including the effective motion of 
the decoherence operators is equivalent to including the motion of all the eigenvectors of 
the density matrix. It is nevertheless useful to write the operators, as we have above, in 
the density matrix eigenbasis. This is because in certain cases, for example if the noise is 
isotropic (the same in all directions in Hilbert space), the noise is basis independent, and so 
the problem reduces entirely to the dynamics of the A„ and Zn- 

2.3 Specifying the Regime of Good Control 

The regime of good control is defined as that in which the feedback protocol keeps the system 

very close to the target state throughout the evolution. That is, the probability that the 
system will be found in the target state, P, is close to unity throughout the evolution. This 
is a regime that many feedback control systems will wish to operate in, especially if they are 
designed to operate in the steady state, and is thus an important regime. The utility of writing 
the equations of motion of feedback control in terms of the eigenvectors and eigenvalues of p 
is that it allows us make the approximation P = 1 — s, with e ^ 1. To do this we first note 
that P can only be close to unity if both Zn and P„ are close to unity, for the same value 
of the index n. Ordering the eigenvalues in decreasing order, this means that Aq = 1 — A, 
Zo = 1 — S, where A and 6 are small. (Note that here we have assumed that zo is real. We 
will justify this assumption below.) 

We now choose to specify the regime of good control so that the small eigenvalues, and 
the small coefficients z„, n > 1, are of the same order. Since Izq]"^ + = 1) this 

means that, to first order in 6 

'5 = ^Nn|'~A2, n>l. (18) 

This means that, in expanding the equations of motion to first order in A, we can set 5 = 0. 
This is not the only choice we can make, but it is the one that results in equations of motion 
that are linear in the Zn- We will return to discuss this choice in the final section of this 
paper. The probability that the system is in the target state, P, becomes 

P « l^ol^Ao « 1 - A. (19) 

Now we see why we can assume that zo is real: if 5 = 0, then zq is constant, and we are free 
to choose the global phase of the target state so that zo is initially real. 
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We now expand the equations of motion of the eigenvalues to first order in A, and the 
result is 



dXn = —Xni Skdt 
for n > 1, and 



+ V8kdW{Xoo - X„„) > (20) 



dA = J2 = -Sfcdi ^ X„\Xon\^ + VskdW AXoo - ^ A„X„„ . (21) 

The purpose of expanding the equations of motion to first order in A is to simplify them. 
Since the equations of motion for the Zn are already linear in the Zn, the wisdom of applying 
the expansion to them is not so clear. However, this expansion does do two things. The first 
is to reduce the number of variables by eliminating zq. The second is that it removes some 
nonlinear terms of the form XjZ^- The resulting equations of motion for the z„ are 



dzn = kdt 



J#0 



2kdWX, 



n 



-Xr, \ kdt 



4:Xqo + XojXjn 



SkdWXor, 



4Xon(XoO ~ ^jj) + XojXjn — 2 XokXkn 



8kdWXor, 



-Zn ( kdt 



(A„ - Xj) 



2 r^'"-J I 



- z,F„,iX,X) 



(22) 



for n > 1, where 
Fnj (A, X) 



-kdt 



3(Aj + A„) 
(Aj - A„) 



XmXot, 



-kdt Y 



(A„ — Xk) + {Xj — Xk) 2(A„ + Xk){Xj + Afe) 



{Xj - A„) 



{Xj — A„)(Aj — Afe) 



XjoXon 



-Akdt 



A,(A,+A„) ^ {^ K) ^ ^ 



(A,-A„)2 4;-«-". 



2kdW 



Aj + A„ 



(23) 



^3 _ 

In the regime of good control, the dynamics due to the environment (Eqs.([T2|) - ([14])) 
becomes 



Xn = 27 



|ino|^ — XnDnn — Aj(|Lno|'^ — \Lnj\'^) 



(24) 
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where, as above, = L^^^Lkn- Finally, the dynamics due to the Hamiltonian is 



When using the above equations, Eas. (|20|) - p?| . it is important to remember that the ele- 
ments of all operators, the measured observable, X, the Hamiltonian, H, and the decoherence 
(Lindblad) operator L, are given in the basis of the eigenvectors of p. So long as X and H 
are determined by the feedback protocol, it does not matter which basis one writes them 
in. However, for the noise operator L, and for the measured observable if it is fixed, then 
the elements of these operators are not constant in the density matrix eigenbasis, but are 
determined by the evolution of the eigenvectors, as describe above. 

3 A Single Qubit: Feedback Control with Unbiased Measurements 

To calculate the steady-state performance of a feedback protocol, we need to solve for the 
steady-state values of A„, averaged over all trajectories. The differential equations for A„ are in 
general coupled to those for the z„. While the equations of motion for the z„ are linear, those 
for the eigenvalues are only linear if the system has two states. In addition, if the dimension of 
the system is larger than two, in general the equations of motion have multiplicative noise (the 
noise multiplies the variables themselves). For linear systems driven by additive noise, general 
and exact results exist for feedback control, providing both optimal [2l [25j and robust |33] 
control protocols. So we see that in the regime of good control, these exact results can be 
applied to a single qubit, but not to higher dimensional systems. 

We will now use the equations derived in the previous section to calculate the performance 
of feedback control for a single qubit, where the measured observable is continually adjusted 
so that it is always unbiased with respect to the eigenbasis of the density matrix. This kind 
of "unbiased" measurement has a special property; it eliminates the stochastic terms in the 
equations of motion for the eigenvalues. To see why we first note that the diagonal matrix 
elements of an observable that is unbiased w.r.t the density matrix eigenbasis are all equal 
in this basis. Further, the master equation that describes the measurement of X, Eq.([2|), is 
invariant under the transformation X X + al, where / is the identity operator and a is any 
real number. This means that we can always choose X so that Tr[X] = 0. With this choice, 
the diagonal elements of an unbiased X are all zero, and this eliminates the stochastic terms 
from equations for A„. An unbiased measurement has also been shown to be the optimal 
measurement for feedback control in the regime of strong feedback [26j . 




For a single qubit, since the diagonal elements of X are zero, X is specified by only one 
complex number, Xqi- An examination of the eqation of motion for zi shows that the phase 
of this complex number only serves to redistribute the noise between the real and imaginary 
parts of zi , and thus does not affect the resulting performance of the feedback protocol, we 
chose Xqi to be real. This means that we can completely absorb Xqi into the measurement 
strength fc, and thus set Xqi = 1. The equations of motion for the qubit in the regime of 




(26) 
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good control are then 

Ai = 2jLlo-{8k + 2j[Ll^ + Ll„])Xi+ijLio{Ln-Lon)zi (27) 
dzi = [-'-f{Dio-2LQQLia)-iuHio]dt-2j{Dio- L^^Lii+Lg„Lio)Xidt 

-{k + -fDii+iuHii)zidt + V2kdW, (28) 

Here we have also extracted an overall rate constant, u, from the Hamiltonian, so that the 
Haniiltonian matrix elements, Hj]^, are now dimensionless. 

Since the density matrix eigenbasis is determined entirely by zi , the elements of the noise 
operator L that appear in the equations above are, to first order in A, 



-^00 


— Lqo + LqiZi + LiqZj^, 


Lqi 


= -LqqZi + 


Liiz* + Lqi 


Lio 


= -LqqzI + 


LiiZi + LiQ 


Lii 


— Lii — Lqi 


z\ - LiQZi, 



where the Ljk are constant. 

We wish to obtain the steady-state solutions to Eqs. (p7|) and ((28|) . We now simplify 
the analysis by specializing to the important and widely applicable case in which the qubit 
undergoes both spontaneous decay at rate F, for which 

and dephasing at rate 7, for which 

l-f 

X 1 

The dynamical equations for the feedback control problem reduce to 



^=1"? n )• (30) 



Ai = -2(4fc + r + 27)Ai + 2(r + 7), (31) 
dzi = -{k + j-iuHii)zidt-iuHiQdt + V2kdW, (32) 

Now the equations have simplified, in that Ai is decoupled from zi. We can immediately 
obtain the steady-state value of Ai, which is 

T + -y 

Af = ZI . (33) 

^ 4fc + r + 27 ^ ' 

The performance, in this case of all feedback control algorithms that use unbiased measure- 
ments, in the regime of good control, is therefore 

Pss^l- Xf = ■ (34) 

J- + 4fe+7 

However, this performance will only be achieved so long as the feedback is sufficient to keep 
the system in the regime of good control. To determine the conditions under which this is 
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true we must solve the equation of motion for zi. Since this equation is stochastic, this means 
determining the steady-state values of the mean and variance of zi. To do this we must first 
determine the optimal feedback Hamiltonian. We note from Eq. ([32| that the measurement 
only introduces noise into the real part of zi . So once the feedback Hamiltonian has been used 
to damp Im[zi] to zero it will stay zero, and so in the steady-state the optimal Hamiltonian is 
just that which damps the real part of zi as fast as possible. For a fixed maximum feedback 



r2 

feedback, which means that H (x zi, the feedback Hamiltonian is then 



strength /i = Tr[_ff^], this just means choosing H to be proportional to ay. Choosing linear 



H = Rc[zi]o-y, (35) 

and the resulting equation of motion is 

dRe[zi] = -(fc + 7 + u)Rc[zi]dt + V^kdW. (36) 

This stochastic differential equation is known as an Ornstein-Uhlenbeck equation, and solving 
it is straightforward using the standard techniques of Ito calculus [SU |4] . The steady-state 
mean and variance are 

(Re[zi])ss = 0, (37) 
V;.(Re[zi]) = J" , (38) 

where V^s denotes the variance. The conditions under which optimal linear feedback control 
preserves the regime of good control is Ai <C 1 and Y^K5s(Re[zi]) ^ 1. Using Eqs. (|33|) and 
(l38l) these conditions become 



k > r-h7, (39) 
Vil > \/fc. (40) 

This is the complete solution to the steady-state feedback control problem for a single qubit 
undergoing decay and/or dephasing, with feedback from an unbiased measurement, in the 
regime of good control. 

4 Discussion 

In this work we have defined the regime of good control as that in which, for the duration 
of the control, the relationship between the target state, and the system density matrix, 
P = E«=o^ A„|n)(n|, is given by 

N-l 

|^) = (l-A)|0) + 5]z„|n), (41) 

n=l 

where 

A„ - A < 1, n>l (42) 
z„ - A < 1, n>l. (43) 
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That is, we have treated the eigenvalues of the density matrix, and the coefficients of the 
target state in the density matrix eigenbasis, z„, on the same footing regarding the small 
parameter A. This level of approximation preserves the linearity of the equations of motion 
for the coefficients z„ , and thus gives us the simplest description of feedback control. 

However, if we take a look at the conditions upon the measurement strength, fc, and 
feedback strength, m, required to satisfy the regime of good control for a single qubit (Eqs.(f39|) 
and (|40|) ). we see that these conditions do not give measurement and feedback quite the same 
status. The condition given by Eq. (l39p . coming from the requirement on the eigenvalues, 
Ea. (|l^ . involves a ratio of the rate constants. But the condition Eq. (|l(I)l . coming from 
the requirement on the coefficients, Ea. (|15)l . involves a ratio of the square roots of the rate 
constants. This means that the condition z„'^A<Cl,n>lisa stronger requirement than 
the eigenvalue condition A„^A^l,n>l. This comes from the fact that it is the variance 
of the coefficient zi that is proportional the ratio of the rate constants, and not the standard 
deviation of zi . 

The above analysis implies that to impose the same level of constraint upon the eigenvalues 
and the coefficients, we must instead place the same requirement on the eigenvalues as we do 
on the square moduli of the coefhcients. That is 

|z„|2 ^ A«l, n>l. (44) 

This less restrictive regime does not give linear equations for the coefficients z„, but instead 
gives equations that depend also upon z^. We will not investigate this regime further here, 
but we feel that it is an interesting subject for future work. 

Returning to the regime of good control as we have defined it here, we can summarize 
the results as follows. The dynamics is described by two sets of variables, the eigenvalues 
of the density matrix (giving — 1 independent real variables), and the elements of the 
eigenvectors (giving another iV^ independent real variables [S])- How many of the elements of 
the eigenvectors are actually required depends on the noise, however. If the noise is completely 
independent of the eigenvectors (isotropic), or the system is a single qubit, then only the 
coefRcients of the target state in the basis of the eigenvectors are required to describe the 
dynamics. In this case there are a total of only 2{N — 1) real dynamical variables, where N 
is the dimension of the system. 

The equations of motion for the eigenvectors (and thus the coefficients of the target state, 
and the elements of the noise operators) are linear, but the equations of motion for the 
eigenvalues are nonlinear for every system with dimension higher than two. This is in contrast 
to the regime of good control for classical systems. As the dimension of the system increases, 
the nonlinearity of the equations of motion for the eigenvalues increases, in that higher powers 
of the eigenvalues appear in these equations. This means that obtaining steady-state solutions 
to these equations involves solving increasingly high order polynomials as the system size 
increases. In addition to being nonlinear, the equations of motion also contain multiplicative 
noise for dimensions higher than two. 

In the example that we analyzed for a single qubit, we have considered only optimizing the 
feedback while fixing the measurement strategy. Since the equations of motion in the regime 
of good control are linear for a single qubit, we expect that analytic expressions can be derived 
giving fully optimal feedback protocols for arbitrary noise, using results from classical control 
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theory. Deriving and exploring these protocols is a natural topic for further work. 
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